Using Annotations from Controlled Vocabularies to Find Meaningful Associations

نویسندگان

  • Adam Woei-Jyh Lee
  • Louiqa Raschid
  • Padmini Srinivasan
  • Nigam H. Shah
  • Daniel L. Rubin
  • Natalya Fridman Noy
چکیده

This paper presents the LSLink (or Life Science Link) methodology that provides users with a set of tools to explore the rich Web of interconnected and annotated objects in multiple repositories, and to identify meaningful associations. Consider a physical link between objects in two repositories, where each of the objects is annotated with controlled vocabulary (CV) terms from two ontologies. Using a set of LSLink instances generated from a background dataset of knowledge we identify associations between pairs of CV terms that are potentially significant and may lead to new knowledge. We develop an approach based on the logarithm of the odds (LOD) to determine a confidence and support in the associations between pairs of CV terms. Using a case study of Entrez Gene objects annotated with GO terms linked to PubMed objects annotated with MeSH terms, we describe a user validation and analysis task to explore potentially significant associations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Framework for Discovering Associations from the Annotated Biological Web

During the last decade, biomedical researchers gained access to the entire human genome, reliable high-throughput biotechnologies, and affordable computational resources and network access. In combination, these new tools created a new model for biomedical research that no longer uses computational tools merely to monitor research, but instead exploits these tools to acquire knowledge and make ...

متن کامل

A Framework for Discovering Meaningful Associations in the Annotated Life Sciences Web

Title of dissertation: A FRAMEWORK FOR DISCOVERING MEANINGFUL ASSOCIATIONS IN THE ANNOTATED LIFE SCIENCES WEB Woei-Jyh (Adam) Lee, Doctor of Philosophy, 2009 Dissertation directed by: Professor Louiqa Raschid Department of Computer Science During the last decade, life sciences researchers have gained access to the entire human genome, reliable high-throughput biotechnologies, affordable computa...

متن کامل

Using Gene Ontology and genomic controlled vocabularies to analyze high-throughput gene lists: Three tool comparison

In genomic and molecular biology domains, controlled vocabularies and ontologies are becoming of paramount importance to integrate and correlate the massive amount of information increasingly accumulating in heterogeneous and distributed databanks. Although at present they are still few and present some issues, they can effectively be used also to biologically annotate genes on a genomic scale ...

متن کامل

بررسی میزان همخوانی عبارت‌های جستجوی کاربران با اصطلاحات پیشنهادی مقالات در پیشینه‌های کتابشناختی پایگاه‌های اطلاعاتی لاتین EBSCO و IEEE

Purpose: This study aims to investigate correspondence of users' queries with alternative terms of Latin databases namely IEEE and EBSCO. Databases display subjective content of their documents through natural or controlled language vocabularies in specified bibliographic fields along with other bibliographic information that are called papers alternative terms. Methodology: We used content an...

متن کامل

An Algorithm for Generating Representative Functional Annotations Based on Gene Ontology

The authors address the issue of providing highly representative descriptions in automated functional annotations. For an uncharacterized sequence, a common strategy is to infer such annotations from those of well-characterized sequences that contain its homologues. However, under many circumstances, this strategy fails to produce meaningful annotations. Using information revealed by the struct...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007